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■ Abstract 

^3 ■ We study properties of rank metric and codes in rank metric over finite fields. We show 

that in rank metric perfect codes do not exist. We derive an existence bound that is the 
equivalent of the Gilbert- Varshamov bound in Hamming metric. We study the asymptotic 
behavior of the minimum rank distance of codes satisfying GV. We derive the probability 
distribution of minimum rank distance for random and random GF(g)-linear codes. We give 
an asymptotic equivalent of their average minimum rank distance and show that random 
GF(q)-\ineax codes are on GV bound for rank metric. We show that the covering density 
of optimum codes whose codewords can be seen as square matrices is lower bounded by a 
CZ2 , function depending only on the error-correcting capability of the codes. We show that there 

are quasi-perfect codes in rank metric over fields of characteristic 2. 

■ 1 Introduction 

. Rank metric over finite fields used for purposes of correcting rank errors in information transmission 

was first introduced by E. M. Gabidulin in 1985, [H]- In this metric codes of length n can either 
be viewed as sets of vectors over some extension field GF(q m ) or equivalently as m x n matrices 
over some base field GF{q). The rank of the codewords corresponds to the usual matricial rank 
. over GF(q) for the matricial representation. This metric is suitable for correcting errors which can 

occur as matrices of upper-bounded rank. In the same seminal paper, E. M. Gabidulin constructed 
an optimal (for the metric) family of linear codes using properties of the Frobenius automorphism 
of the extension. They can be considered as the equivalent for rank metric of Reed-Solomon codes. 
They have polynomial time decoding algorithms based either on some kind of extended Euclidian 
algorithm for the family of linearized polynomials. More recently, in 1991, R. M. Roth designed a 
family of matricial codes over GF(2) correcting crisscross errors up to a given rank. These codes 
are suitable for correcting errors that occur on rows or columns of matrices like for example on 
tape storage. In fact they form the matricial counterpart of the family of optimal codes published 
by E. M. Gabidulin [23|. Other families of codes with polynomial-time decoding algorithm have 
been designed by considering the projection of these codes on subfields or on subspaces, or even 
by adding some random matrices to the structure of direct product of the codes (RRC codes for 
example), see |21lll()j . 

Apart from being suitable for correcting crisscross errors as they might occur on tape storage, 
rank metric was involved in the design of coding-theory based public-key cryptosystems with 
reduced key size. The main reason was that the decoding complexity of random linear codes in 
rank metric is much higher than in Hamming metric for the same set of parameters (compare 
the complexity of algorithms described in p[J E2 to the complexity of the algorithm given in 
[4] for decoding in Hamming metric). Therefore the size of the public- key can be significantly 
reduced by keeping the same security level against decoding attacks. This consideration lead to 
the design of several public-key cryptosystems relying on this metric ever since, see for example 

03 EH EH □ Eld- 
Very recently rank metric over finite fields found applications in the field of space-time coding 
where the concept of diversity states the minimum rank of differences of matrices composing a code 

* Pierre.Loidreau@ensta.fr 



1 



over the complex field. Indeed, it has been shown how to construct codes with a given diversity 
d from codes over GF(2) with minimum rank distance d, see for example [£2\. Part of this work 
was further extended in |15j . 

All these reasons and also the dispersion of results concerning rank metric motivated our 
interest in the study of its general properties over finite fields, as well as properties of codes in this 
metric. Therefore, this paper does not only present new results but collects and present results 
that have already been published or are widely known about rank metric over finite fields and 
about codes in the metric. 

In a first part we recall some basic facts about the metric and some of its properties. We also 
give bounds on the volumes of spheres and balls of the metric. This result is used in the proofs 
further in the paper. Concerning the codes we give the definition of minimum rank distance for 
non-linear and for linear codes. We show that the operation of transposing the code is a rank 
preserving isomorphism. This enables us to deal only with the case where the length of the code 
is less than the extension degree of the code. Is it not the case ? Just take the transposed code. 

Then, on the same model as in Hamming metric, we define the classical sphere-packing and 
Singleton bound. Thanks to the sphere-packing inequality and to the approximations on the 
volume of balls, we show that no perfect codes exist in rank metric. 

We prove a Gilbert- Varshamov like bound on the existence of code with given parameters, and 
we derive an asymptotic equivalent of the minimum rank distance of a code satisfying said to be 



Concerning random codes, we evaluate their average minimum rank distance and show that, in 
well-chosen cases, the probability that the effective minimum distance deviates from the average 
case tends to asymptotically. More precisely, we show that random GF(q) -linear codes are 
asymptotically on GV. 

In the last part, we study codes that satisfy the Singleton inequality. They are called MRD- 
codes for Maximum Rank Distance codes. After recalling the formula given by Gabidulin on the 
rank distribution of linear MRD-codes, we present some simulations showing that rank distribution 
of random codes and of MRD-codes is very similar. In addition, we prove that the density of 
correctable errors for MRD-codes corresponding to codes formed with square matrices is lower 
bounded by a function depending only on the error-correcting capability of the code. In the 
special case where one considers fields of characteristic 2, we show that even if there are no perfect 
codes in rank metric, quasi-perfect codes exists, that is there exists a family of codes which are 
asymptotically perfect. 

2 Properties of rank metric 

In the rest of the paper the code alphabet will be the finite field GF(q m ) with q m elements where 
q is the power of some prime. Let b = (f3\, . . . ,f3 n ) be a basis of GF(q m ) over GF(q). The 
integer n will denote the length of the code. Thus vectors of the ambient space GF(q m ) n will 
be indifferently considered as vectors with components in GF(q m ) or as m x n g-ary matrices 
obtained by projecting the elements of GF{q m ) on GF(q) with respect to the basis b. 
Rank norm of a vector from GF(q m ) n is defined by 

Definition 1 (0) 

Let x = (xi, ...,x n ) e GF(q m ) n . The rank o/x on GF{q), is the rank of matrix 



where Xj = Yll=i x ij0i- It is denoted by Rk(x\GF(q)), or more simply by i2fc(x) whenever there is 
no ambiguity. 



on GV. 
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Rank metric is the metric over GF(q m ) n induced by the rank norm. Given a vector x g GF{q m ) n 
spheres and balls in rank metric have the following expression: 

• Sphere of radius t > centered on x: <S(x, t) d = {y e GF(q m ) n | Rk(y - x) = t}. 

• Ball of radius t > centered on x: £>(x, t) = U * =0 5(x, i). 

Since rank metric is invariant by translation of vectors, the volumes of spheres and balls do not 
depend on the chosen center. Therefore to simplify notations, we define 

def 

• St — volume of sphere of radius t in GF(q m ). It is equal to the number ofrnxw 3-ary 
matrices of rank = t. If t = then So = 1 and for t = 1, . . . , min(n, to) it is equal (see for 
example [Q) to 

- 1 - 1 - q l — qi 

3=0 H H 

def 

• B t = volume of ball of radius t in GF(q m ). It is equal to the number of m x n matrices of 
rank < t in GF{q). Therefore 

t 

Vf = 0, ...,min(n,m), ^ = ^5,. (2.2) 

i=0 

2.1 Bound on spheres and balls in rank metric 

These quantities H2.1J1 and H2.2H being not very convenient to handle, we derive the following 
bounds, which are not very tight but sufficient and easy to handle for the following of the paper: 

Proposition 1 

For all t = 0, . . . , min(n, m), we have 



I _(m+n-2)t-r < S t < < ? ( m +™+ 1 )*-* 
| q(m+n-2)t-t 2 < B t < q( m +"+ 1 )*-* 2 + 1 

Proof. It is clearly true for t = 0. Now, for t = 1, . . . , min(n, to), we have 
• Volume of spheres: Formula for il2.3|) can be rewritten 

t-i 



(2.3) 



qj-t 

3=0 H 

For all j = 0, . . . , t — 1, since t < min(n, m) we have 

(l-q t - n - 1 )(l-q t - m - 1 ) < (1 gJ" m ) < 1, 
1 > > 1-1 = 2--. 

— y — q q 

For the same reason (1 - q- 1 ) < (1 - g t-«^(n,m)-i)_ Therefore 

(1 - 29- 1 + 9 - 2 ) < (1 - ^""-^(l - q'-" 1 - 1 ). 

Since (? has to be the power of a prime, we have q > 2. This implies that (1 — 2g _1 ) > 
and that q~ 2 < (1 — (7 t_ " _1 )(l — q' - ™" 1 ). Moreover under the same condition we have 
q/(q — 1) < q. Therefore every term of the product satisfies 

v n * i -2 ^ (l-^-")(l-^- m ) . 
Vj=0,...,i-1, g < T -— t <g. 

And we obtain the desired result for the volume of the sphere of radius t. 
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• Volume of balls: Since the volume of a ball is larger than that of a sphere with the same 
radius, the lower bound comes directly from the lower bound for spheres. Let L = m + n + 1, 
the upper bound on the volume of spheres implies that 

B t <J2q Li ~ i2 = q Lt ~ t2 ( 1 + Y^q^-^-A . 

i=0 \ i=0 J 

For % from to t - 1 we have L — i — t>L — t and (i — t)< 0. Therefore < 
q( L ~ t )( i ~ t ) . And by changing variables, we finally obtain 

B t <q Lt - t2 \ l + Y^q- {L - t]i 

3=1 

The geometrical sum can easily computed. Namely, 



* i „-(L-t)(t+l) 



l-q-(L-t) - 1 _ q -(L-t)- 

Since t < min(m, n) and q > 2, we have 1/(1 — <7~( L ~ f )) < 2 < q. This proves the result for 
the volume of Bt. 



2.2 Minimum rank distance of a code 

A code C of length n and of size M over GF(q m ) is a set of M vectors of length n over GF{q m ). 
Its minimum rank distance is defined by 

Definition 2 

def 

Let C be a code over GF(q m ), then d = min cl ^ C2e e(-R&( c i — c 2)) is called minimum rank 
distance of C. 

If the code is GF(q) -linear (it is most often the case when considered as a matricial code) or 
even linear and since rank metric is invariant by translation, the minimum rank distance of the 
considered code is 

d = min (Rk(c)). (2.4) 
c^oec 

If d is the minimum rank distance of C we will say that C is a (n, M, c?) r -code. Moreover if the 
code is linear of dimension k we will say that it is a [n, k, d] r -code. 

2.3 Transposed code 

A (n, M, d) r -code over GF{q m ) can be seen as a (m, M, eJ) r -code over GF(q n ) by the following 
procedure : As in definition ^ any row vector x <G GF(q m ) n of rank t corresponds to a g-ary 
m x n matrix X by extending its components row- wise on the chosen basis B of GF{q m )/GF{q). 
Consider the following mapping 

T: GF{q m ) n ► GF{q n ) m 

where 71, . . . , 7„ is a basis of GF(q n )/GF(q). Clearly the mapping T is a rank-preserving isomor- 
phism between GF{q m ) n and GF(q n ) m . By using this mapping we define the notion of transposed 
code. 
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Definition 3 (Transposed code — see |13| ) 

• Let x £ GF(q m ) n , then x T = T(x) is called transposed vector of x. 

• Let C be a (n,M,d) r code over GF(q m ), the (m, M,d) r -code over GF(q n ) defined by C T = 
{x T | x £ C}, is called transposed code of C. 

The definition and properties of transposed codes lead us to consider only the case where 
n < m, the other cases being symmetric by taking the corresponding transposed code. 

Remark 1 

By properties ofT , if a code C is GF(q)-linear then its transposed code C T is also GF(q)-linear. 
However T being only GF(q)-linear and not GF{q m ) linear, ifC is linear then C T has no reasons 
to be linear. 

3 Upper bounds and perfect codes 

In this section we recall a Singleton-like bound for rank metric codes and state an equivalent to 
the sphere-packing bound. We show that there are no perfect codes in rank metric. 

Theorem 1 

Let C be a (n,M,d) r code over GF(q m ). We have 

• Singleton-like bound: M < q min {m(n-d+i),n(m-d+i)) _ 

• Sphere packing-like bound: If t = [(d — l)/2j, then 

M x B t < q mn , (3.5) 

For the proof of Singleton-like bound see (HIE]- The proof of the sphere-packing bound comes 
from the fact that, for rank metric, two balls of radius t = \_(d — 1)/2J centered on codewords do 
not intersect. Thus, the full covering has size less than the whole space. The proof is identical as 
for codes in Hamming metric. 

Now the Sphere-packing bound raises a question: If we define a perfect code for rank metric as 
a (n, M, d) r -code over GF(q m ) such that M x Bt = q mn , does a such code exist and for which set 
of parameters ? 

The following proposition answers the question 

Proposition 2 There are no perfect codes in rank metric. 

Proof. Suppose on the contrary that a perfect code does exist with parameters (n, M, d) r over 
GF(q rn ), that is suppose that 

M xB t = q mn . 

Without loss of generality we can assume that n < m (Else consider the transposed code). The 
right part of the inequality 12. .'ill on the volume of balls implies that 

Moreover, from Singleton bound we have M < q^-d+i) _ Since t = _ 1)/2J this implies that 
M < g™(™- 2 *). Therefore 

q{m+n+l)t-t 2 +m(n-2t) + l > ^rnn 

By taking the base q logarithm of the inequality and by reordering the terms, we obtain 

(n - m)t >t 2 -t + l. 
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By hypothesis n — m < and t > 0. Therefore we must have t 2 — t — 1 < 0. Since i is integer the 
only possibility is t = 1 and accordingly n = m. In that case however the formula that parameters 
have to satisfy is M x B\ = q n . Hence 

f(n _ 2) q -2q +q > M g -2g +1 + 1 = ?n 2 
9 - 1 



which implies 




Singleton 



l -^ + ^T>1-l- (3.6) 



q" q 

This inequality cannot be satisfied for q > 2. 
■ 

Though no perfect codes exist, we will see in section l6~2l that considering the case where q = 2 
and m = n, there exist families of linear codes that are almost perfect that is, given any integer 
/ < 2" 2 , there is a (n, M, d) r code such that / < MB t < 2™ 2 . 



4 A Varshamov-Gilbert like bound 

Until now we have obtained bounds and results on the non-existence of codes in rank metric with 
given parameters. What then of the existence of codes ? We define an equivalent of the famous 
Varshamov-Gilbert bound for the rank metric. An alternative definition was given in ^4], but here 
it is more convenient to establish another one for future proofs. We also define what does mean 
for a code to be on GV, and give an asymptotic equivalent of the bound under some conditions 
on m and n. 



Proposition 3 

Let m, n, M, d be positive integers. If 

M x B d -! < q mn , (4.7) 
then there exists a (n, M + 1, d) r -code over GF(q m ). 

Proof. We suppose given integers n and d, M satisfying equation I|4.7J1 , We will prove the 
proposition by constructing a code with required parameters. 

• Constructing a code with two elements: we choose randomly a vector c € GF(q m ) n . Since 
M > 1, and from equation (|4~7|l. we have that GF(q m ) n \ B(c, d - 1) ^ 0. Therefore there 
is at least a vector d G GF(q m ) n \ B(c, d—1). The code with two elements formed by c and 
d has minimum rank distance at least d. 

• Induction step: Suppose that we have constructed a (n, M', d)-code C where M' < M. Let 
us consider the set formed by the union of balls of radius d—1 centered on the codewords, 
we have 

V = f \jB(c,d-l). 

cec 

This implies in particular that |V| < M x Bd-i- Therefore, from equation I|4.7J1 there exists 
a vector a e GF(q m ) n \ V. Therefore C = C U {a} is a (n, M' + 1, d) r -code. 

■ 

From this we now deduce another definition: 



Definition 4 

A (n, M,d) r -code is said to be on GV if 



(M - 1) x B d -i < q mn < M x B d - U (4., 
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We can wonder if as it is the case in Hamming metric we can obtain an asymptotic value for 
the minimum rank distance of a code on GV ? 

Proposition 4 

Consider a (n, M, d) r -code C over GF(q m ) where m > n. Then, if C is on GV we have 

Oj n — >-(-oo 1 \Z^°gq M (m — ?l) 2 

m + n ~ 2 m + n V 4 log g M ' 
provided mn > \og q M = X(n)(m + n), where A(n) = o(n) tends to +oo with n. 

Proof. By taking the base q logarithm of the inequalities we obtain from property (|4.8I) 

that 

( mn < (to + n + l)(d - 1) - (d - l) 2 + 1 + logg M, 
\ \og q (M - 1) + (to + n - 2){d - 1) - (d - l) 2 < mn. 

Since M > 2 we have further that log ? (M - 1) > log 3 (M) - log 9 (2) > log g (M) - 1. Hence the 
minimum distance of the code must satisfy 

J < -d 2 + (to + n + 3)d + log q M - mn - (m + n + 1), 
> — d 2 + (to, + n)d + log g M — mn — (m + n). 

The inequalities are given by second order equations whose discriminant are respectively 

Ai = (to - n) 2 + 4 log g (M) + 2(to + n) + 5, 
A 2 = (to - n) 2 + 41ogg(M) - 4(to + n). 

Therefore the minimum distance of a code on GV satisfies the inequalities 

1 -VA7 + 3 d 1 

7 < < — ■ 

2 2(to + n) m + n 2 2(to + n) 

Under the conditions of the theorem (logg M = X(n)(m + n), where A(n) = o(n) and tends to 

infinity with n), it is not very difficult to complete the proof of the proposition. 

■ 

Example 1 

A special case is when to = n and for a family of constant rate codes < R < 1 that is 

logg M = n 2 R. 

In that case we have 




n 

This result implies that the ratio of the minimum rank distance on the length of the code is asymp- 
totically constant. 

Remark 2 

Defining an equivalent of GV bound is less relevant in rank metric than in Hamming metric. 
Namely, in section fKH[ we will see that for any set of parameters codes satisfying Singleton equality 
exist, therefore always exceeding GV bound. Moreover, they are at least additive and can easily 
be constructed from linear codes and overall we know how to construct a polynomial-time decoder. 
Therefore this case is radically different from the case of Hamming metric where there no MDS- 
binary codes exist. Random binary codes are proved to be on GV, which gives a benchmark for 
codes that can be constructed. 

In rank metric the main interest of establishing the GV bound is to study how random codes 
are good regarding this bound - this is the object of next section. Another interesting question is 
to measure the interval between the minimum rank distance of codes on GV and Singleton bound. 
Bounds were given in \1$ and simulation were made on the case where the ratio n/m is constant 
< 1. 
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5 Random codes 



Random codes provide benchmarks for the codes that can be constructed. In Hamming metric, 
random GF(g)-linear codes are good. They are even often better than families of codes for which 
we know polynomial-time decoders. In addition it was shown that linear codes asymptotically 
achieve GV-bound. The question is the same concerning rank metric. Where are random codes 
located compared with GV bound ? 

First we define what are random codes 

Definition 5 

• A random code of length n and of size M over GF(q m ) is a set of M distinct uniformly 
chosen vectors of GF(q m ) n . 

• A random GF(q) -linear code of length n and of size M = q K is defined 

C = | E A A ' V -? =!.■■•.*, | Xj e GF(q) | , 

where for all j = 1, . . . , K , hj <E GF(q m ) n are GF(q) -linearly independent uniformly chosen 
vectors. 

In this section we first establish the probability distribution of the minimum rank distance 
of random codes and of random GF(g)-linear codes. After upper bounding this distribution, we 
show that the probability that the minimum rank distance of the code is outside some interval 
decreases exponentially whenever the length of the codes increases. This enables us to prove the 
following theorem, which is the main result of the section. 

Theorem 2 

Let C be a (n,M,d) r -random (GF(q)-linear or not) code over GF(q m ). Let N = M — 1, if C 
is GF(q)-linear and N = M(M — l)/2 if not. Suppose moreover that n < m and q amn > N > 
3log q (n)q m+n for some a < 1. If 

,, defm + n (m — n) 2 
M n = — \/ V 4 +log q N, 

then the expectation E(d) and the variance Var(d) of the minimum rank distance of C satisfy for 
n — > oo 

f E(d) = M n + 0(1), 
\ Var{d) = 0{M n ). 

Since, in the hypotheses of the theorem the M n tends to infinity with n, an immediate corollary 
of the theorem obtained by applying the well-known Chebyshef 's inequality is 

Corollary 1 

Let (C n ) be a family of (n, M n ,d n ) r -random codes over GF(q m ) where 

• m > n, 

• q amn > M n > 3log q (n)q m+n for a fixed a<\. 
Then let ip be an increasing function of n. We have 

Pr (\d n - E(d n )\ > V(n)V^) ™ 
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This corollary shows that, whenever the conditions are satisfied, the minimum distance of random 
codes corresponds to its expectation with a probability as small as possible. 

Therefore, in the case of random GF(g)-linear codes that is, whenever N = M — 1 we deduce 
a result analog to the result in Hamming metric: 

Corollary 2 

Under the condition of the previous corollary, random GF(q)-linear codes are on GV. 

5.1 A bound on the minimum rank distance distributions 

We first deal with random codes and then with random GF(g)-linear codes. 



Random codes Let C be a (n, M, G?) r -random code over GF(q m ). We define the following 
random variables: 



\fi = 1, . . . , n, 



Ai = |{(ci,c 2 ) e C 2 | Rk(ci -c 2 ) = i}| , 

Vi = U\ =1 Ai = | {(ci,c a ) G C 2 , Cl ^ c 2 | Rk(ci - c 2 ) < ^} 



The random variable Ai counts the number of pairs of codewords at distance exactly equal to 
i whereas the random variable X>j counts the number of pairs of distinct codewords at distance 
< i. From this, the random variable d specifying the minimum rank distance of C satisfies the 
distribution 

Vi = 1, . . . ,n, Vi d = Pr(d = i) = Pr(2Vi = 0, V, > 1). 

From Bayes' rule we have Pr(d = i) = Pr(X>j > 1 | V^ x = 0) Pr(2Vi =0). For all i = 1, ... , n, 
the random variable T>i can be expressed as 

' D i= X] 1 Rk(c-d)<i 
c#dGC 2 

Therefore T>i is equal to if and only if for all c € C and deC such that c^d, Rk(c — d) > i + 1. 
Since the cardinality of the code is equal to M and by taking into account the symmetries (i.e. 
the fact that Rk(c — d) = Rk(d — c)), we have 

„ \ M(M-l)/2 



Pv(V i . 1 =0)=\l-^j . (5.9) 

We also have 

Pr(P 4 > 1 | = 0) = 1 - Pr(P 4 = | = 0). 

It is clear equally that, since Vi = Ai U Vi-i, we have 

Pr(P, = | 2Vi = 0) = Pr(Ai = | V^ = 0). 

Since the words of the codes are randomly chosen one can see that 

c \ Af(Af-l)/2 



Pr(A = 0|^_ 1 =0)=(l- ?m 



Hence we finally obtain 



- \ M(A/-l)/2 

^- g^-VJ • (5 - 10) 
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Random GF(g)-linear codes The approach for GF(g)-linear codes is the same except that 
G-F^-linearity modifies the definition for the minimum rank distance of the code, see formula 
<G3||. If we set 



Vi = 1, . . . , n, 



Ai = \{c e C | Rk(c) = i}\ , 

Vi = uj = iA = | {c e c \ {0}, | Rk(c) < i} 



we have 

V* = 1, . . . Pl d = Pr(rf = i) = Pr(2?,_ 1 = 0, V t > 1). 

Since 

Vl = 1 Rk(c)< 4 ' 
cec\{o} 

and since the vectors bj forming a GF(g)-basis for C are uniformly chosen, one can check that 
Pr(Rk(c) = i | c e C) = Pr(Rk(c) = i) = B l /q mn . Therefore 

Pr(^_! = 0) = Pr(Vc G C \ {0}, Rk(c) > i) = ( 1 - -^-J (5.11) 

and 

/ c. 

Pr(P 4 > 1 | = 0) = 1 - (l - gmn _' gi _J (5-12) 

From the previous paragraphs, by multiplying l|5.9ll by H5.10J1 in the general case or H5.11fl by 
15.12|l in the Gi^(q)-linear case and since Bi = Bi-i + Si one obtains the following proposition 

Proposition 5 

Let C be a (n,M,d) r random code over GF{q m ). Let N = M — 1, if C is GF{q)-linear and 
N = M(M — l)/2 if not. Let 

Vi = 1, . . . , n, Pi = Pr(rf = i). 

Then we have 

Vi=l,..,n, P t =(l-^) N -{l-^) N , (5-13) 

where Bi designs the volume of the ball of radius i in GF{q m ) n . 

From formula H5.13H and using the fact that for any positive integer N, 

N-l 

Va > b > 0, a N -b N = (a-b)J2 a^b N - 1 - ] < N(a - b) (max(a, bjf' 1 , 

3=0 

we deduce the following upper bound 

< NS i ( 1 ft- A*- 1 

^ — qmn I qtnn J \ ) 

The following lemma will help to prove theorem[2l by giving ranges for integer 1 < i < n where 
the probability pi is upper-bounded by an exponential term. 

Lemma 1 

Let C be a (n,M,d) r random code over GF{q m ). Let N = M — 1, if C is GF{q) -linear and 
N = M (M — l)/2 if not. Let 



Vi = 1, . . . , n, pi = Pr(d = i), then 
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• Ifl<i< ^ - ^ (? " 4 " )2 + log, N + ^ + A(n), tfien i/iere is a positive constant < d 

< Ci<T A(n) (5.15) 

• JfN > q m+n pi{n) andifn>i> - y / ('»-") 2 + log<j - (m + ra) - log g i/ien 
i/iere «s a positive constant C2 such that 



P. 



< e - C2M(n) . (5.16) 



Proof. 

To prove this lemma, we upper-bound Pi by upper-bounding the inequality Q5.14I1 
• The upper bound 112, gives 

< iV^ / 1 _ \ ^ 1 < < t m+n+1)i _^_ mn+log ^ N _ 



<i 

The right part of the inequality is smaller than (7 -A (") if and only if — i 2 + (to + n + l)i 
ran + log q N + X(n) < 0. The discriminant of this second order inequation is 

Ai = (to - n) 2 + 1 + 2(m + n) + A\og q N + 4A(n). 

Therefore if i is smaller than the smallest root of the inequation, that is if 



to + n + 1 / (to — n) 2 , n + to 

* < — v ^4" + 89 + + (n) + 7 ' 



then by taking C\ = q 1 ^ 4 the inequality l|5.15|l is satisfied. 
• For the other bound we have 

JV-1 



Pi 

1 — qrnn \ qmn 
<1 

Since by property of the logarithm function V0 < x < 1 ln(l — x) < —x, we have 

n \ N ~ l k 



Nil- Zizi. < e 

V <f nn / 

Moreover, if we use the lower bound given in Il2..'tt . we have 

-(N - I) — — < - a (">-+n-2)(i-i)-(i-i) 2 -m,n+io S ^N-i) 
q rnn — 

Hence 

7V /'i_^i^ iV < e -M»), 



provided in particular that g (™+«-2)(*-i)-(*-i) 2 -»m+io gg (JV-i) > ^ Thig corresponds t0 

— i 2 + (to + n)i — (to + n) + 1 — to??. + log 9 (./V — 1) — log g /i(r?) > 0. 
The discriminant of the second order equation is equal to 

A 2 = (m - n) 2 + 1 + 4(log g (iV - 1) - (to + n) - \og q fi(n)). 
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Since by hypothesis we have N > q m+n fj,(n) we can show that A 2 > and since there exists 
a constant C such that V7V > 1 log (N — 1) < log, N + C, for all integer i such that 



771 ~\~ 77 I ( 777 — 7% ) ^ 

n>i> — y + log, N-(m + n)- log, fi(n), 

we also obtain pi < e ~ C2 ^ n \ where C% — q c is a constant 

■ 

The asymptotic behavior of the upper bounds on the p, depend on how the parameters m and 
N increase when the length n of the code increases. The idea to prove theorem [51 is to consider 
positive increasing functions A and fi, sufficiently fast but not too fast so that the contribution to 
that the probability that the minimum rank distance of the code be equal to i outside the interval 
given by 



m + n / (m — n) 2 , n + m . . m + n / (m — n) 2 , . . , . . 

— y ^ — + log, N + — ^— + A(n) < i < — y ^ — + log, N - (m + n) - log, fx(n) 

tends asymptotically to 0. 

5.2 Proof of the theorem 

Now we will prove theorem [21 

Proof of theorem l2l 

By definition the expectation of the minimum rank distance is given by 



ip, 



We shall use the results obtained in lemmaHwith A(n) = /u(n) = 3 log, n. This choice is arbitrary 
but sufficient for the rest of the proof. 

The interest is that, since log, N increases faster than (m + n) the contribution of functions A 
and \i under the radical is always negligible compared to the contribution of the other functions. 
We define 



1 ±n_^Jir^)l + i ogqN+ iii±2i + x(n) 



yjiS^L + log, N (m + n) - log, 



From the hypotheses, and the choices A and (J,, for sufficiently large n, a n > 1 and b n < n. From 
the results of lemma d for all 1 < i < a n and b n < i < n, there is a constant C such that 
Pi < C/n 3 . Therefore 

E£i<ft < = o($), 

EUjPi < &£ti* = o(i). 

Moreover since we deal only with positive terms, we have 

fc„-l 6 n -l 6„-l 

i=a n -\-l i= a n +l -i— a n + l 

By using once more the results of previous lemma and the fact that ^2^=1 — 1 we have 

c b ^ 
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Hence, if we write M n = ^ - y/ {m ~ n) ' + log g AT, then 

On - M n - ^ + O (-) < E(d) - M n < b n -M n + ( -) . 

We have 

| -l + ^^^+log q N-^^+log q N+^ + X(n) < a n -M n , 

( 1 + J&^t + \og~N + i og9 N {m + n) _ logg M(n) > 6 n -M„. 

For the integer satisfying the hypotheses of the theorem, in both cases the difference between 
radicals has finite limit when n tends to infinity. Therefore the difference between radicals is equal 
to 0(1) 

One can equally check that the quantity a n does not increase faster than n 2 for m > n. This 
implies that ^ = 0(1). Therefore 

E(d)=M n + 0(l). (5.17) 
The variance is given by Var(d) = E(d 2 ) — E(d) 2 . From H5.17J1 we have 

E{d) 2 = Ml + 0{M n ), 

and from the definition of E(d 2 ). 

a n n b n — 1 

E(d 2 ) = j2^+Y, i2 p>+ £ * 2 p- 

i— 1 i—b n i— a n -\-l 

=o(i) ^ooT' 

Using the same approach as before, under the conditions of the theorem, M n tends to infinity 
with n. Therefore O(l) is negligible compared to 0(M n )- 

a\ - Ml + O(Mn) < Var{d) < b\ - M\ + 0{M n ). 

Moreover a\- M\ = (a n +M n )0{l) = 0(M n ) and b\-M\ = (b n +M n )0(l) = 0(M n ), which 
gives the desired result. 



6 Maximum Rank Distance codes 

Singleton inequality provides an upper bound on the cardinality of codes with given parameters. 
We call optimal codes or MRD (Maximal Rank Distance) codes, codes satisfying the Singleton 
equality 

Definition 6 (MRD-codes (U) 

A (n,M,d) r -code over GF(q m ) is called MRD if 

• M = q m ( n ^ d + 1 ) ! if n < m. 

• M = q m ( n - d + 1 ) ; if n >m 

From this definition it follows that, whenever a code is MRD, the corresponding transposed code 
is also MRD. 
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6.1 Rank weight distribution of MRD-codes 

In Hamming metric, the weight distribution of MDS-codes is well-known ^H|- Gabidulin showed 
the rank distribution of codes in rank metric could be expressed by the following formula. 

Proposition 6 (|8|) 

Let A s (n, d) be the number of codewords of rank s of a MRD-code over GF(q m ). Then 



[n, d) 



EM) 



t+e 



d + l 
t + t 



(6.18) 



where 



n 
i 



is the Gaussian binomial. 



Our contribution to this section comes from the simulations we made to evaluate the random- 
ness degree of MRD-codes. Through these simulations we obtained that the rank distribution 
of random G.F(g)-linear codes in rank metric was almost identical to the weight distribution of 
linear MRD-codes. Results are presented in table 16. ll The table gives the base 2 logarithm of the 
proportion Ai(32)/2 mn for m > 32. The left-most curve corresponds to m = 32, the right-most 
to m = 40. We made simulations for random GF(<7)-linear codes as well as for MRD-codes suf- 
ficiently large with the same parameters. For ranks significantly greater than the minimum rank 
distance both curves coincide very accurately. 



Table 1: Base 2 logarithm of proportion of words of given rank in an MRD-codes of length n = 32 
over GF(2 m ), where m = 32, 33, 35, 40. 



6.2 Covering density of MRD codes 

In sectional we proved that no perfect codes existed in rank metric. However a natural question 
could be: what is the defect of perfectitude of MRD-codes that is, given a (n, M, d) r MRD-code 
what is the volume of the space covered by balls of radius \_(d — 1)/2J compared to the volume of 
the whole space. The covering density of the code is thus defined by 
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where t = [(d — 1)/2J. By using the bounds on the volume of the balls given in 112. 3JI . we get 
immediately the following proposition 



Proposition 7 (Density of MRD-codes) 

Let C be a MRD-code, (n,q m ( n ~ 2t ',2t+ l) r over GF(q m ). The covering density of C satisfies 



1 <D< 1 



q(m-n+2)t+t 2 — — q(m-n-l)t+t 2 ' 

The proposition shows that whenever the length of the code equals the extension degree, i.e. 
n = m, which can be the case for square-matrix codes, and if n tends to oo, then its covering 
density is lower bounded by the quantity q~ l ~ 2t depending only on the rank error-correcting 
capability of the code. 

Particular case of rank 1 correcting MRD codes For 1-error correcting codes where m = n, 
we can express the exact formulas for balls and obtain. 

Proposition 8 

A (n,q n ~ 2 ,3) r MRD-code over GF{q n ) has a covering density equal to 

D = - ^ . 6.19 

q - 1 

There is a special interest in the binary case. In sectional we have proved that there are no 
perfect codes in rank metric. We deduce the following corollary from previous proposition 

Corollary 3 

Let T = {Ci}i>2 be a family of (i,2 l ~ 2 ,3) r MRD-codes over GF(2 l ). If Di is the covering 
density of code Ci then 

lim Di = 1. 

i — >oc 

This means that J 7 is a sequence of codes with increasing length and alphabet that are asymptot- 
ically perfect. We say that the codes in T are quasi-perfect. 

6.3 A family of decodable optimal codes: Gabidulin codes 

In previous section, we established results on MRD-codes before knowing if codes with such optimal 
properties existed. The answer is yes and a family of linear MRD-codes was constructed by 
E. M. Gabidulin in 1985, see [Hj. In 1991 R. M. Roth designed also a family of MRD-codes for 
rank metric in the form of m x n-matrix code. It appeared that both constructions were equivalent. 
Codes defined by R. M. Roth can be obtained by considering the matricial form over the base field 
of the words of some Gabidulin codes [23] . 

Definition 7 (Gabidulin code) 

Let g = (<7i, . . . ,g n ) be a vector of elements of GF(q m ) that are linearly independent over 
GF(q), let 

(gi ■■■ g n 

: . '-. ! | (6-20) 

[fe-i] [fc-i] 

9l ■■■ 9n 

def 

where [i] = q l . The code generated by G is the k-dimensional Gabidulin code Ga6/ £ (g). g is called 
generating vector g of the code. 
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Gabidulin codes are evaluation codes of so-called linearized polynomials also called (/-polynomials 
or still 0re polynomials since they were first introduced by O. 0re in the 30 's, see ^3 120] . 
Gabidulin codes share many similarities with the well-known family of Reed-Solomon codes. In 
particular the polynomial-time decoding algorithms are based on some Extended Euclidian algo- 
rithm for linearized polynomials or a Welch-Berlekamp approach. The interested reader can refer 
to the historical genesis of the algorithms (HI El EH EH El EH ■ 

From previous section we know their rank distribution, as well as their covering density. Since 
Gabidulin codes are MRD-codes, we can show that 

Proposition 9 

For all i > 2, Ga&i_3(gi), is a quasi-perfect code provided the vectors gi G GF(2 1 ) 1 consist of 
components that are linearly independent over GF(2). 

Proof. 

Consider the family T — {Gabi-3(gi)}i>2, where g; G GF{2 % ) is formed with linearly inde- 
pendent components. The family T satisfies the conditions of corollary El 



7 Conclusion 

In this paper we presented general results about rank metric and the existence and the properties of 
codes in this metric. Problems and properties of a such metric have not been as intensively explored 
as properties of Hamming metric. Some interesting problems already solved for Hamming metric 
remain unsolved yet. For example, we do not know any equivalent of the MacWilliams transforms, 
which link the minimum rank distance of a linear code and the minimum rank distance of its dual. 
Some steps towards this achievement can be found in a work published by Delsarte [jjj. In that 
paper, matrices with coefficients in a finite field are considered as matrices of quadratic forms and 
notion of orthogonality defined is related to the trace of a product of matrices. By this approach, 
the author established an equivalent of MacWilliams transforms. However, there are no simple 
transformations linking the standard scalar product over finite fields and the trace of the product 
of corresponding matrices. 

Another interesting point to investigate could be to construct codes with a polynomial-time 
decoding algorithm which are not designed on the basis of the codes introduced in [H1E3I. Indeed, 
even subspace subcodes or subfield subcodes of these codes are isomorphic to codes from the same 
family with smaller parameters, see E3- Projecting the codes does not affect their structure 
conversely to the case of Hamming metric where many interesting and widely used codes are 
subfield subcodes of Reed-Solomon codes. 
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